
The Biotechnology Systems Branch of the Scientific and Technical Information 
Center (STIC) detected errors when processing the following computer readable 
form: 




Application Serial Number: > <r . ^ . ^ y ^ 

Source: fCTOf. 
Date Processed by STIC: 



THE ATTACHED PRINTOUT EXPLAINS DETECTED ERRORS. 

PLEASE FORWARD THIS INFORMATION TO THE APPLICANT BY EITHER: 

1) INCLUDING A COPY OF THIS PRINTOUT IN YOUR NEXT COMMUNICATION TO THE 
APPLICANT, WITH A NOTICE TO COMPLY or, 

2) TELEPHONING APPLICANT AND FAXING A COPY OF THIS PRINTOUT, WITH A 
NOTICE TO COMPLY 

FOR CRF SUBMISSION QUESTIONS, PLEASE CONTACT MARK SPENCER, 703-308 4212. 



FOR SEQUENCE RULES INTERPRETATION, PLEASE CONTACT ROBERT WAX, 703-308-4216. 
PATENTIN 2.1 e-mail help: patin21help@uspto.gov or phone 703-306-4119 (R. Wax) 
PATENTED 3.0 e-mail help: patin30help@uspto.gov or phone 703-306-4119 (R. Wax) 

TO REDUCE ERRORED SEQUENCE LISTINGS, PLEASE USE THE CHECKER 
VERSION 3.0 PROGRAM . ACCESSIBLE THROUGH THE U.S. PATENT AND 
TRADEMARK OFFICE WEBSITE. SEE BELOW: 



Checker Version 3.0 

The Checker Version 3.0 application is a state-of the-art Windows based software program 
employing a logical and intuitive user-interface to check whether a sequence listing is in 
compliance with format and content rules. Checker Version 3.0 works for sequence listings 
generated for the original version of 37 CFR §§1.821 - 1.825 effective October I, 1990 (old 
rules) and the revised version (new rules) effective July 1, 1998 as well as World Intellectual 
Property Organization (WIPO) Standard ST.25. 

Checker Version 3.0 replaces the previous DOS-based version of Checker, and is Y2K- 
compliaht. Checker allows public users to check sequence listings in Computer Readable form 
(CRF) before submitting them to the United States Patent and Trademark Office (USPTO). 
Use of Checker prior to filing the sequence listing is expected to result in fewer errored sequence 
listings, thus saving time and money. 



Checker Version 3.0 can be down loaded from the USPTO website at the following address: 



http://www.uspto.gov/web/offices/pac/checker 



Page 1 of 8 



PCT09 



RAW SEQUENCE LISTING 

PATENT APPLICATION: US/09/701 f 395 



DATE : 01/23/2001 
TIME: 11:13:18 
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Input Set : A:\108172-00022.txt 

Output Set: N:\CRF3\01232001\l701395.raw 

<I10> APPLICANT: CUNNINGHAM JR., FRANCIS X. 
SUN, ZA1UEN 

<12(» TITLE OF INVENTION: GENES OF CAROTENOID BIOSYNTHESIS AND METABOLISM AND 

METHODS OF USE THEREOF 
<130> FILE REFERENCE: 8.172-902 3 

<140> CURRENT APPLICATION NUMBER: US/09/701,395 
<141> CURRENT FILING DATE: 2000-12-04 

<150> PRIOR APPLICATION NUMBER: 09/088,724 
0 5.1> PRIOR FILING DATE: 1998 -06 "02 
<150> PRIOR APPLICATION NUMBER: 09/088,725 
<151> PRIOR FILING DATE: 1998-06-02 
<16(» NUMBER OF SEQ ID NOS : 61 
<17 0> SOFTWARE : Pa ten tin Ver. 2.0 
<210> SEQ ID NO: 1 
<211> LENGTH: I860 
<2.12> TYPE: DNA 

<213> ORGANISM: Arabitfopsis thaliana 
<220> FEATURE: 
<221> NAME/KEY 
<22 2> LOCATION 
<400> SEQUENCE 



P 



, (1680) 



CDS 
(109) . 
1 

acaaaaggaa ataattagat tcctctttct gcttgctata ccttgataga acaatataac 

aatggtgtaa gtcttctcyc tgtattcgaa attatttgga ggaggaaa atg gag tgt 

Met Gin Cys 
1 

gtt ggg get agg aat ttc gca gca atg gcg gtt tea aca ttt ccg tea 
Val Gly Ala Arg Asn Phe Ala Ala Met Ala Val Ser Thr Pht?. Pro Ser 

5 10 15 

tgg agt tgt cga-agg aaa ttt oca gty gtt aag aga tac ago tat agg 
Trp Ser Cys Arg Arg Lys Phe Pro Val Val Lys Arg Tyr Ser Tyr Arg 
20 25 30 35 

aat att cgt ttc ggt ttg tgt agt gto aga get age ggc ggc gga agl 
Asn lie Arg Phe Gly Leu Cys Ser Val Arg Ala Ser Gly Gly Gly Ser 

40 45 50 

tec ggt agt gag agt tgt gta gcg gtg aga gaa gat ttc get gae gaa 
Ser Gly Ser Glu Ser Cys Val Ala Val Arg Glu Asp Phe Ala Asp Glu 

55 60 65 

gaa gat ttt gtg aaa get ggt ggt tct gag att eta ttt gtt caa atg 
Gin Asp Phe Val Lys Ala Gly Giy Ser Glu lie Leu Phe Val Gin Met 

70 75 80 

eag cag aac aaa gat atg gat gaa cag tel. aag ctt gtt gat aag ttg 
Gin Gin Asn Lys Asp Met Asp Glu Gin Ser Lys Leu Val Asp Lys Leu 

85 90 95 

ect cet ata tea att ggt gat ggt get ttg gat eat gtg gtt att. ggt 
Pro Pro Tie Ser Tie Gly Asp Gly Ala Leu Asp' His val Val He Gly 
100 105 110 115 

tgt ggt cet get ggt tta gee ttg get gca gaa tea yet aag ctt gga 



Does Not Comply 
Corrected Diskette Needer 



60 

117 



165 



213 



261 



309 



357 



405 



453 



501 
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RAW SEQUENCE LISTING DAT hi : 01/23/2001 

PATENT APPLICATION: US/09/701, 395 TIME: 11:1.3:18 



InpuL Set : A:\108172-00022.txt 

Output Set: N:\CRF3\01232O01\l701395.raw 



71 


Cys 


Gly 


Pro 


Ala 


Gly 


Leu 


Ala 


Leu 


Ala 


Ala 


Glu 


Ser 


Ala 


Lys 


Leu 


Gly 




12 










120 










125 










.130 






74 


tta 


aaa 


qtt 


qga 


etc 


att 


gqt 


cca 


gat 


ctt 


cct 


ttt 


act 


aac 


aat 


tac 


54 9 


75 


Leu 


L.ys 


val 


Gly 


Leu 


Tie 


Gly 


Pro 


Asp 


Leu 


Pro 


Phe 


TUr 


Asn 


Asn 


Tyr 




76 








135 










140 










145 








78 


99 1 


gtt 


tgy 


gaa 


gat 


gaa 


ttc 


aat 


gat 


ctt 


ygg 


ctg 


caa 


aaa 


tgt 


att 


59 7 


79 


Gly 


val 


T.rp 


Glu 


Asp 


Glu 


Phe 


Asn 


ASp 


Leu 


Gly 


Leu 


Gin 


Lys 


Cys 


He 




80 






150 










155 










1 GO 










82 


gag 


cat 


gtt 


tqg 


aga 


gag 


act 


att 


gtg 


tat 


ctg 


gat 


gat 


gac 


aag 


cct 


645 


83 


Glu 


His 


Val 


Trp 


Arg 


Glu 


Thr 


He 


Val 


Tyr 


Leu 


Asp 


Asp 


Asp 


Lys 


Pro 




84 




165 










170 










175 












86 


att 


acc 


att 


qgc 


cgL 


get 


tat 


gga 


aga 


gtt 


agt 


eg a 


cgt 


ttg 


etc 


cat 


693 


87 


lie 


Thr 


lie 


Gly 


Arg 


Ala 


Tyr 


Gly 


Arg 


Val 


Ser 


Arg 


Arg 


Leu 


Le.u 


His 




88 


180 










185 










190 










195 




yn 


gag 


qug 


CtL 


ttg 


ayg 


agg 


tgt 


gtc 


gag 


tea 


ggt 


gtc 


teg 


tac 


e tt 


age 


741 


91 


Glu 


Glu 


Leu 


Leu 


Arq 


Arg 


Cys 


Val 


Glu 


Ser 


Gly 


Val. 


Ser 


Tyr 


Leu 


Ser 




9 2 










200 










205 










210 






94 


Leg 


aaa 


gtt 


gac 


age 


ata 


aca 


gaa 


get 


tct 


gat: 


gge 


ctt 


aga 


ctt 


gtt 


789 


9 5 


Spr 


Lys 


Val 


Asp 


Ser 


He 


Thr 


Glu 


Ala 


Ser 


Asp 


Gly 


Leu 


Arg 


Leu 


Val 




9(i 








215 










220 










225 








98 


get 


tgt 


gac 


gac 


aat 


aac 


gtc 


att 


ccc 


tgc 


ago 


ctt 


gee 


act 


gtt 


get 


837 


99 


Ala 


Cys 


Asp 


Asp 


Asn 


Asn 


val 


He 


Pro 


Cys 


Arg 


Leu 


Ala 


Thr 


Val 


Ala 





10 0 






230 










235. 










240 










102 


tct 


gga 


gca 


get 


teg 


gga 


aag 


etc 


ttg 


caa 


tac 


gaa 


gtt 


gqt 


gga 


cot 


885 


10 3 


Ser 


Gly 


Ala 


Ala 


Ser 


Gly 


Lys 


Leu 


Leu 


Gin 


Tyr 


Glu 


Val 


Gly 


Gly 


Pro 




104 




245 










250 










255 












106 


aga 


g Lc 


tgt 


gtg 


caa 


act 


gca 


tac 


ggc 


gtg 


gag 


gtt 


gag 


gtg 


gaa 


aat 


93 3 


.10 7 


Arg 


Val 


Cys 


Val 


Gin 


Thr 


Ala 


Tyr 


Gly 


Val 


Glu 


Val 


Glu 


Val 


G.l u 


Asn 




108 


260 










265 










270 










275 




110 


agt 


cca 


tat 


gat 


cca 


gat 


caa 


atg 


gtt 


ttc 


atg 


gat 


tac 


aga 


gat 


tat 


981 


111 


Ser 


P ro 


Tyr 


Asp 


Pro 


Asp 


Gin 


Met 


Val 


Phe 


Met 


Asp 


Tyr 


Arg 


Asp 


Tyr 




112 










280 










285 










290 






114 


act 


aac 


gag 


aaa 


gtt 


egg 


age 


tta 


gaa 


get 


gag 


tat 


cca 


acg 


ttt 


ctg 


1029 


115 


Thr 


Asn 


Glu 


Lys 


Val 


Arg 


Ser 


Leu 


Glu 


Ala 


Glu 


Tyr 


Pro 


Thr 


Phe 


Leu 




116 








295 










300 










305 








.118 


tac 


gee 


atg 


cct 


atg 


aca 


aag 


tea 


aga 


etc 


ttc 


ttc 


gag 


gag 


aca 


tgt 


1077 


119 


Tyr 


Ala 


Met 


Pro 


Met 


Thr 


Lys 


Ser 


Arg 


Leu 


Phe 


Phe 


Glu 


Glu 


Thr 


Cys 




120 






310 










315 










320 










122 


ttg 


gee 


tea 


aaa 


gat 


gtc 


atg 


ccc 


ttt 


gat 


ttg 


eta 


aaa 


aeg 


aag 


etc 


1125 


12 3 


Leu 


Ala 


Ser 


Lys 


Asp 


Val 


Met 


Pro 


Phe 


Asp 


Leu 


Leu 


Lys 


Thr 


Lys 


Leu 




125 




325 










330 










335 












127 


atg 


tta 


aga 


tta 


gat 


aca 


etc 


gga 


att 


cga 


att 


eta 


aag 


act 


tac 


gaa 


1173 


128 


Met 


Leu 


Arg 


Leu 


Asp 


Thr 


Leu 


Gly 


He 


Arg 


He 


Leu 


Lys 


Thr 


Tyr 


Glu 




129 


340 










345 










350 










355 




131 


gag 


gag 


tgg 


tec 


tat 


ate 


cca 


gtt 


ggt 


ggt 


tec 


ttg 


cca 


aac 


acc 


gaa 


122.1 


132 


Glu 


Glu 


Trp 


Ser 


Tyr 


He 


Pro 


Val 


Gly 


Gly 


Ser 


Leu 


Pro 


Asn 


Thr 


Glu 




133 










36 0 










365 










370 






13 5 


caa 


aag 


aat 


etc 


gec 


ttt 


ggt 


get 


gee 


get 


age 


atg 


gta 


cat 


ccc 


gca 


1269 


136 


Gin 


Lys 


Asn 


Leu 


Ala 


Phe 


Gly 


Ala 


Ala 


Ala 


Ser 


Met 


val 


His 


Pro 


Ala 
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137 








375 










380 










3R5 








139 


aca 


ggc 


tat 


tea 


gtt 


gtg 


aga 


tct 


ttg 


tct 


gaa 


get 


eca 


aaa 


tat 


gca 


1317 


140 


Thr 


(My 


Tyr 


Ser 


Val 


val 


Arg 


Ser 


Leu 


Ser 


Gl u 


Ala 


Pro 


Lys 


Tyr 


Ala 




141 






390 










395 










400 










14 3 


tea 


gtc 


ate 


gca 


gag 


ata 


eta 


aga 


gaa 


gag 


act 


ace 


aaa 


eag 


ate 


aae 


136 5 


1-14 


Ser 


Val 


He 


Ala 


Glu 


He 


Leu 


Arg 


Glu 


Glu 


Thr 


Thr 


Lys 


Gl n 


He 


AStl 




14 5 




405 










410 










415 












147 


agt 


aat 


att 


tea 


aga 


caa 


get 


tgg 


gat 


act 


tta 


tgg 


eca 


era 


gaa 


agg 


1413 


148 


Ser 


Asn 


Ik 


Ser 


Arg 


Gin 


Ala 


Trp 


Asp 


Thr 


Leu 


Trp 


Pro 


Pro 


Glu 


Arg 




14 9 


420 










425 










430 










435 




l r >l 


aaa 


aga 


eag 


aga 


gca 


I to 


ttt 


ete 


ttt 


ggt 


ett 


gca 


etc 


ata 


gtt 


caa 


1461 


.152 


L.ys 


Arg 


Gin 


Arg 


Ala 


Phe 


Phe 


Leu 


Phe 


Gly 


Leu 


Ala 


Leu 


He 


Val 


Gin 




153 










440 










445 










450 






155 


ttc 


gat- 


ace 


gaa 


gge 


att 


aga 


age 


ttc 


ttc 


egt 


act 


ttc 


ttc 


cgc 


ett 


1509 


150 


Phe 


Asp 


Thr 


Glu 


Gly 


lie 


Arg 


Ser 


Phe 


Phe 


Arg 


Thr 


Phe 


Phe 


Arg 


Leu 




157 








4 55 










460 










465 








159 


ecu 


aaa 


tgg 


atg 


tgg 


caa 




ttt 


eta 


qga 


tea 


aea 


tta 


aca 


tea 


gga 


1557 


160 


Pro 


Lys 


Trp 


Met 


Trp 


CI u 


Gly 


Phe 


Leu 


Gly 


Ser 


Thr 


Leu 


Thr 


Ser 


Gly 




161 






470 










475 










480 










163 


qat 


etc 


qtt 


etc 


ttt 


get 


tta 


tae 


atg 


ttc 


gtc 


att 


tea 


eca 


aac 


aat 


1605 


164 


Asp 


Leu 


Val 


Leu 


Phe 


Ala 


Leu 


Tyr 


Met 


Phe 


Val 


lie 


Ser 


Pro 


Asn 


Asn 




1.6 5 




4 85 










4 90 










495 












16 7 


ttg 


aga 


aaa 


ggt 


etc 


ate 


aat 


cat 


etc 


ate 


tct 


gat 


eca 


acc 


99 a 


gca 


16 5 3 


168 


Leu 


Arg 


Lys 


Gly 


Leu 


He 


Asn 


His 


Leu 


He 


Ser 


Asp 


Pro 


Thr 


Gly 


Ala 




169 


500 










505 










510 










515 




171 


acc 


a tig 


ata 


aaa 


ace 


tat 


etc 


aaa 


y ta 


tgatttaett ateaaetett 




1700 


172 


Thr 


Met 


lie 


Lys 


Thr 


Tyr 


Leu 


Lys 


Val 


















.17 3 










520 


























175 


aggtttgtgt atatatatgt tgatttatct; 


: gaataatcga 


tcaaagaatg gtatgtgggt 


1760 


177 


tactaggaag ttggaaaeaa acatgtatag aatctaagga 


gtgatcgaaa tggagatgga 


1820 


:i 7 9 


aacgaaaaga aaaaaatcag tctttgtttt gtggttagtg 












1860 


1.82 


<21.0> SEQ ID NO: 


: 2 


























18 3 


<211> LENGTH: 524 


























184 


<212> TYPE: 


PRT 




























18 5 


<213> ORGANISM: 


Arabidopsis 


thai Lana 
















187 


<4 00> SEQUENCE: 


2 


























.108 


Met 


Glu 


Cys 


Val 


Gly 


Ala 


Arg 


Asn 


Phe 


Ala 


Ala 


Met 


Ala 


Val 


Ser 


Thr 




189 


1 








5 










10 










15 






.191 


Plie 


Pro 


Ser 


Trp 


Ser 


Cys 


Arg 


Arg 


Lys 


Phe 


Pro 


Val 


Val 


Lys 


Arg 


Tyr 




19 2 








20 










25 










30 








194 


Ser 


'j-y r 


Arg 


Asn 


lie 


Arg 


Phe 


Gly 


Leu 


Cys 


Ser 


Val 


Arg 


Ala 


Ser 


Gly 




195 






3 5 










40 










45 










197 


Gly 


Gly 


Ser 


Ser 


Gly 


Ser 


Glu 


Ser 


Cys 


Val 


Ala 


val 


Arg 


Glu 


Asp 


Phe 




198 




50 










55 










60 












200 


Ala 


Asp 


Glu 


Glu 


Asp 


Phe 


Va 1 


Lys 


Ala 


Gly 


Gly 


Ser 


Glu 


He 


Leu 


Phe 




201 


65 










70 










75 










80 




20 3 


Val 


Gin 


Met 


Gin 


Gin 


Asn 


Lys 


Asp 


Met 


Asp 


Glu 


Gin 


Ser 


Lys 


Leu 


Val 




204 










85 










90 










95 






206 


Asp 


Lys 


Leu 


Pro 


Pro 


Tie 


Ser 


Tie 


Gly 


Asp 


Gly 


Ala 


Leu 


Asp 


His 


Val 
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207 








100 










105 










110 






209 


val 


lie 


Gly 


Cys 


Gly 


Pro 


Ala 


Gly 


Leu 


Ala 


Leu 


A 1. a 


Ala 


Glu 


Se r 


Aid 


210 






1 1.5 










1 20 










12 5 








21 2 


LYS 


Leu 


Gly 


Lett 


Lys 


va I 


G Ly 


l.eu 


He 


Gly 


Pro 


Asp 


Leu 


Pro 


P h e 


Thr 


213 




.130 










13 5 










14 0 










213 


ash 


Asn 


Tyr 


ciy 


val 


Trp 


Glu 


Asp 


Glu 


Phc 


Asn 


ASp 


Leu 


Gly 


Leu 


CI a 


216 


14 5 










150 










155 










160 


2 1 B 


Lys 


Cys 


V le 


Glu 


Hi.s 


Va I 


Trp 


Arq 


Glu 


Thr 


1 1 o 


Val 


Tyr 


Leu 


Asp 


Asp 


219 








165 










170 










17 5 




221 


ASp 


Lys 


Pro 


lie 


Thr 


lie 


Gly 


Arg 


Ala 


Tyr 


Gly 


Arg 


Val 


Ser 


Arg 


Arg 


222 








180 










185 










190 






224 


Leu 


Leu 


His 


Glu 


Glu 


l.eu 


Leu 


Arg 


Arg 


Cys 


Val 


Glu 


Ser 


G 1 y 


Va .1 


Ser: 


225 






.195 










200 










205 








227 


Tyr 


Leu 


Ser 


Sin- 


Lys 


Val 


Asp 


Ser 


Tie 


Thr 


G l.u 


Ala 


Ser 


Asp 


Gly 


Leu 


228 




210 










215 










220 










2 30 


Arq 


Leu 


Val 


Ala 


Cys 


Asp 


Asp 


Asn 


Asn 


val 


1 le 


Pro 


Cys 


Arg 


Leu 


Al a 


231 


225 










23 0 










23 5 










24 0 


233 


Thr 


Val 


Ala 


SGI' 


Gly 


Ala 


Ala 


Ser 


Gly 


Lys 


Leu 


Leu 


Gin 


Tyr 


Glu 


Val 


234 










245 










2 50 










255 




236 


Gly 


Gly 


Pro 


Art] 


Val 


Cys 


Val 


Gin 


Thr 


Ala 


Tyr 


Gly 


Val 


Glu 


Val 


Glu 


237 








260 










265 










270 






2 39 


Va L 


Glu 


Asn 


Ser 


Pro 


Tyr 


Asp 


Pro 


Asp 


Gin 


Met 


Va 1 


Phe 


Met 


Asp 


Tyr 


240 






275 










280 










285 








242 


Arg 


Asp 


Tyr 


Thr 


Asn 


Glu 


Lys 


Val 


Arg 


Ser 


Leu 


Glu 


Ala 


Glu 


Ty r 


Pro 


24 3 




290 










29 5 










300 










245 


Thr 


Phe 


Leu 


Tyr 


Ala 


Met 


Pro 


Met; 


Thr 


Lys 


Ser 


Arg 


Leu 


Phe 


Phe 


Glu 


2 4 0 


305 










3 1.0 










31 5 










320 


249 


Glu 


Thr 


Cys 


Leu 


Ala 


Ser 


Lys 


Asp 


val 


Met 


Pro 


Phe 


Asp 


Leu 


Leu 


Lys 


250 










325 










330 










335 




252 


Thr 


Lys 


Leu 


Met 


Leu 


Arg 


Leu 


Asp 


Thr 


Leu 


Gly 


He 


Arg 


I le 


Le u 


Lys 


253 








34 0 










345 










350 






255 


Thr 


Tyr 


Glu 


Glu 


Glu 


Trp 


Ser 


Tyr 


Tie 


Pro 


Val 


Gly 


Gly 


Ser 


Leu 


Pro 


256 






355 










360 










36 5 








259 


Asn 


Thr 


Glu 


Gin 


Lys 


Asn 


Leu 


Ala 


Phe 


Gly 


Ala 


Ala 


Ala 


Ser 


Me t 


Val 


259 




3 70 










375 










380 










26.1 


His 


Pro 


Ala 


Thr 


Gly 


Tyr 


Ser 


Va 1. 


Val 


Arg 


Ser 


Leu 


Ser 


Glu 


Ala 


Pro 


2G2 


38 5 










390 










39 5 










40 0 


26 4 


Lys 


Tyr 


Ala 


Ser 


val 


He 


Ala 


Glu 


He 


Leu 


Arg 


Glu 


Glu 


Thr 


Thr 


Lys 


26 5 






4 05 










410 










4.15 




26 7 


Gin 


He 


Asn 


Ser 


Asn 


He 


Ser 


Arg 


Gin 


Ala 


Trp 


ASp 


Thr 


Leu 


Trp 


Pro 


268 








420 










425 










430 






270 


Pro 


Glu 


Arg 


Lys 


Arq 


Gin 


Arg 


Ala 


Phe 


Phe 


Leu 


Phe 


G ly 


Leu 


Ala 


Le u 


271 






43 5 










44 0 










44 5 








273 


lie 


Val 


Gin 


Phe 


Asp 


Thr 


Glu 


Gly 


Tie 


Arg 


Ser 


Phe 


Phe 


Arg 


Thr 


Phe 


274 




450 










455 










4 60 










276 


Phe 


Arg 


Leu 


Pro 


Lys 


Trp 


Met 


Trp 


Gin 


Gly 


Phe 


Leu 


Gly 


Ser 


Thr 


Leu 


277 


465 










470 










475 










480 


279 


Thr 


Ser Gly 


Asp 


f.eu 


Val 


Leu 


Phe 


Ala 


Leu 


Tyr 


Met 


Phe 


Va 1 


He 


Ser 


280 










485 










490 










495 
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RAW SEQUENCE LISTING DAT 13 : 01/23/2001 

PATENT APPLICATION: US/09/701,395 TIME: 11:13:18 

Input Set : A:\108172-00022.txt 

Output Set: N:\CRF3\01232001\I701395.raw 

282 Pro Asn Asn Leu Art) Lys Cly Leu lie Asn His Leu lie Ser Asp Pro 

283 500 505 310 
285 Thr Gly Ala Thr Met lie Lys Thr Tyr: Leu Lys vai 

28G 515 520 

288 <210> SEQ ID NO: 3 

289 <211> LENGTH: 9 56 

290 <212> TYPE : UNA 

291 <2.L3> ORGANISM: A rabid ops i s t.lia liana 
29 3 <4 00> SEQUENCE : 3 

-> 294 gctctttctc ctcctcctct accgatttcc gactccgcct cccgaaatcc ttatccggat 



60 



296 tctcteegtc tcttcgatLL aaacgctttL c tg Lctgtfa eg tcgtegaa gaacggagac 120 

298 agaattctcc gattgagaac qatqagagac cggagaqcac gagctccaca aacgctataq 18 0 

300 acgctgagta r.ctggcgtfg cgtttggcgg agaaattgga gaggaagaaa tcggagaggt 240 
302 ccacf.tar.ct aatcgctgct atgttgtcga gcf.ttggt.at caettetatg gctgttatgg 3 00 

301 ctytttacta cagaUtcLct tgguaaatgg agggaggtga gatctuaatg ttygaaatg t 360 
306 ttggtacatt tgctotetct gttggtgctg etgttggtat ggaattotgg geaagatggg 4 20 
308 c teat a gage tctytggeao go tt. etc tat ggaatatged v.gagtcacat. eacaaaccaa 480 
311 gagaaggacc gtttgagcta aacga tgf.l: t Ltgetatagt gaacgctggt ecaqegattg 54 0 
313 gtctcctctc ttatggattc ttcaatanag gactcgtt.ee tggtctctgc tttggcgceg 600 
315 ggttaggcat aacggtgttt ggaatcgect acatgtttgt ecacgatggt ctcgLgcaca G60 
317 agcgtttccc tgtaggtccc ategcegaeg tcccttaccL ccgaaaggtc gccgccgctc 7 20 
319 accagotaca tcacacagac aagttoaa tg gtgtaccat.a tggact.gttt cLtggaocca 78 0 
321 aggaattgga agaagttgga. ggaaatgaag agttagar.aa ggagattagt eggagaatea 84 0 
323 aatcatacaa aaaggecteg ggctecgggt egagttcgag ttcttgactt taaacaagtt 900 
325 ttaaatccca aattctt.ttt ttgtottctg tcattatgat catcttaaga cggt.ct 956 
327 <210> SEQ ID NO : 4 

3 28 <211> LENGTH: 294 

329 <212> TYPE: PRT 

330 <213> ORGANISM: Arabidopsis tha liana 
332 <400> SEQUENCE: 4 

3 33 Ser Phe Ser Ser Ser Ser Thr Asp Phe Arg a-u Arg Lou Pro Lys Ser 

334 1 5 10 :15 

336 Leu, Ser Gly Phe Ser Pro Ser Leu Arg Phe Lys Arg Phe Ser Vai Cys 

337 * 20 . 25 30 

339 Tyr Vai Vai. Glu Glu Arg Arg G.tn Asn Ser Pro lie Glu Asn Asp Glu 

34 0 3 5 40 ■ 4 5 

342 Arg Pro Glu Ser Thr Ser Ser Thr Asn Ala Tie Asp Ala Glu Tyr Leu 

343 * 50 55 60 

345 Ala Leu Arg Leu Aia Glu Lys Leu Glu Ary Lys Lys Ser Glu Arg Ser 

346 65 70 75 80 

348 Thr Tyr Leu lie Ala Ala Met Leu Ser Ser Phe Gly Tie Thr Ser Met 

349 85 90 95 

351 Ala Vai Met Ala Vai Tyr Tyr Arg Phe Ser Trp Gin Met Glu Gly Gly 

352 100 1.05 110 

354 Glu lie Ser Met Leu Glu Met Phe Gly Thr Phe Ala Leu Ser Vai Gly 

355 115 120 125 

357 Ala Ala Vai Gly Met Glu Phe Trp Ala Arg Trp Ala His Arg Ala Leu 

358 130 135 140 

360 Trp His Ala Ser Leu Trp Met Asn His Glu Ser His His Lys Pro Arg 
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<210> 13 
<211>. 960 
<212> DNA 
■* <213> Tagetes erecta 

<400> 13 



^tflArdr, PiLu-suu cj/wufh 



CCAAAAACAA 


CTCAAATCTC 


CTCCGTCGCT 


CTTACTCCGC 


CATGGGTGAC 


GACTCCGGCA 


60 


TGGATGCTGT 


TCAGCGACGT 


CTCATGTTTG 


ACGATGAATG 


CATTTTGGTG 


GATGAGTGTG 


120 


ACAATGTGGT 


GGGACATGAT 


ACCAAATACA ATTGTCACTT 


GATGGAGAAG 


ATTGAAACAG 


180 


GTAAAATGCT 


GCACAGAGCA 


TTCAGCGTTT 


TTCTATTCAA 


TTCAAAATAC 


GAGTTACTTC 


240 


TTCAGCAACG 


GTCTGCAACC 


AAGGTGACAT 


TTCCTTTAGT 


ATGGACCAAC 


ACCTGTTGCA 


300 


GCCATCCACT 


CTACAGAGAA 


TCCGAGCTTG 


TTCCCGAAAC 


GCCTGAGAGA 


ATGCTGCACA 


360 


GAGGgj^NNNN 


NNNNNNNNNN 


NNNNNNNNNN 


NNNNNNNNNN 


NNNNNNNNNN 


NNNNNNNNNN ^ 


I 420 


NNNNNNNNNN 


NNNNNNNNNN 


NNNNNNNNNN 


NNNNNNNNNN 


NNNNNNNNNN 


NNNNNNNNNN 


480 


NNNNNNNNNN 


NNNNNNNNNN 


NNNNNNNNNN 


NNNNNNNNNN 


NNNNNNNNNN 


NNNNNNNNNN 


540 


NNNNNNNNNN 


NNNNNNNNNN 


NNNNNNNNNN 


NNNNNNNNNN 


NNNNNNNNNN 


NNNNNNNNNN 


600 


NNNNNNNNNN 


NNNNNNNNNN 


NNNNNNNNNN 


NNNNNNNNNN 


NNNNNNNNNN 


NNNNNNNNNN 


660 


NNNNNNNNNN 


NNNNNNNNNN^TCATGTGCAA AAGGGTACAC 


TCACTGAATG 


CAATTTGATA 


720 


TGAAAACCAT 


ACACAAGCTG 


ATATAGAAAC 


ACACCCTCAA 


CCGAAAAGCA 


AGCCTAATAA 


780 


TTCGGGTTGG 


GTCGGGTCTA 


CCATCAATTG 


TTTTTTTCTT 


TTAACAACTT 


TTAATCTCTA 


840 


TTTGAGCATG 


TTGATTCTTG 


TCTTTTGTGT 


GTAAGATTTT 


GGGTTTCGTT 


TCAGTTGTAA 


900 


TAATGAACCA 


TTGATGGTTT 


GCAATTTCAA 


GTTCCTATCG 


ACATGTAGTG 


ATCTAAAAAA 


960 



61 

Please Note: 

Use of n and/or Xaa have been detected in the Sequence Listing. Please review the 
Sequence Listing to ensure that a corresponding explanation is presented in the <220> to 
<223> fields of each sequence which presents at least one n or Xaa. 



VERIFICATION SUMMARY 

PATENT APPLICATION: US/09/701,395 



DATE: 01/2 3/20 01 
TIME: 11: 13: 10 



Input Sot : A:\108172-00022.txt 

Output Set: N:\CRF3\01232001\l701395.raw 



L;12 M:270 C: Current Application Number differs, Replaced Current Application Number 
L:J3 M:27l C: Current Filing Date differs, Replaced Current Filing Date 
J..::i5 M:i:t2 C: (-18) String data converted to lover case, 



M: 


:l.t2 


Repeated 


in SoqNO 1 *: 


1 


















L: 


: 2 9-1 


M : 


112 


C: 


(48) String data converted 


to lover case, 










M: 


: 112 


Repeated 


in SeqNo : : 


3 


















f: 


5 54 


M : 


: 1 12 


(.: : 


(48) String data converted 


to lower ease, 










M : 


112 


Repeated 


in SeqNo»' 


9 


















1.: 


594 


M: 


112 


C : 


(48) Stritiq data converted 


to lower case, 










M: 


.112 


Repeated 


in SeqNo 1 '. 


10 


















L: 


b 3 C) 


M : 


112 




(48) String data converted 


to low«r case, 










M: 


112 


Repeated 


in SeqNo 1 *: 


1.1 


















1.: 


684 


M : 


112 


C: 


(48) String data converted 


to lower case, 










M: 


1 1 2 


Repeated 


in SeqNo-: 


12 


















L: 


729 


M : 


; 112 


C : 


(48) SLring data converted 


to lower case, 










M: 


1 .1 7 


Repeated 


in SeqNo*-* 


13 


















L: 


74 1 


M : 


: 258 


W: 


Mandatory 


Feature 


missing , 


<220> 


not 


found 


for 


SEO 


ID ff 


: 13 


L; 


741 


M: 


258 


W: 


Mandatory 


Fea ture 


missing , 


<221> 


not 


found 


tor 


SEQ 


IDfc 


:13 


T,: 


7 4.1 


M: 


: 258 


W: 


Mandatory 


Feature 


missing , 


<222> 


not 


found 


for 


SEQ 


I D if 


: 13 


L ; 


741 


M : 


: 258 


W: 


Mandatory 


Feature 


missing , 


<22 3> 


not 


found 


for 


SEQ 


ID# 


:13 


L: 


7 4 1 


M : 


: 34 0 


W: 


(46) "n" < 


jr "Xaa" 


used: Feature required, for SEQ ID#:; 


13 


L: 


74.1 


M: 


: 258 


W: 


Mandatory 


Feature 


missing , 


<220> 


not 


found 


for 


SEQ 


IDft 


: 13 


T.: 


74 3 


M: 


258 


W: 


Mandatory 


Feature 


missing , 


<221> 


not 


fou nd 


for 


SEQ 


TD# 


: 13 


h: 


743 


M : 


258 


W: 


Mandatory 


Fea ture 


missing , 


<222> 


not 


found 


for 


SEQ 


IDlf 


: 13 


L : 


; 743 


M: 


: 258 


W: 


Mandatory 


Feature 


miss lug , 


<223> 


not 


found 


for 


SEQ 


lDft 


: .1 3 


H: 


34 0 


Repea t 


.ed 


in SeqNo*; 


13 


















T,: 


7 4 5 


M : 


: 258 


W: 


Mandatory 


Fea Lure 


missing , 


<220> 


not 


found 


for 


SEQ 


TDtf 


: 13 


L: 


7 4 f) 


M : 


258 


W: 


Mandatory 


Feature 


missing , 


<221> 


not 


fou nd 


for 


SEQ 


ID ft 


: 13 


L: 


74 5 


M: 


; 258 


W: 


Mandatory 


Fea Lu r e 


missing , 


<222> 


not 


found 


for 


SEQ 


TDK 


:13 


L: 


745 


M: 


25 8 


W: 


Mandatory 


Fea Lure 


missing , 


<223> 


not 


found 


for 


SEQ 


IDS 


: 13 


L: 


747 


H : 


: 258 


W: 


Mandatory 


Fea Lure 


missing , 


<220> 


not 


found 


for 


SEQ 


ID ft 


: 13 


Tj : 


747 


M : 


: 258 


W: 


Mandatory 


Feature 


missing , 


<221> 


not 


fou nd 


for 


SEQ 


TDfi 


: 13 


L: 


74 7 


M : 


;258 


W: 


Mandatory 


Fea ture 


missing , 


<222> 


not 


found 


for 


SEQ 


IDS 


: 13 


L: 


74 7 


M : 


:258 


W: 


Manda Lory 


Feature. 


missing , 


<2 2 3> 


not 


found 


for 


SEQ 


ID if 


: 13 


L: 


7 4 9 


M : 


:258 


W: 


Mandatory 


Fea ture 


missing , 


<2 20> 


not 


found 


for 


SEQ 


IDs 


: 13 


Ti : 


749 


M : 


: 258 


W: 


Mandatory 


Feature 


missing , 


<221> 


not 


found 


for 


SEQ 


TDfl 


: 13 


I, : 


: 74 9 


M : 


: 258 


W: 


Mandatory 


Fea ture 


missing - 


<222> 


not 


found 


for 


SEQ 


IDfi 


: 13 


L: 


7 4 9 


M ; 


;258 


W: 


Mandatory 


Fea ture 


missing , 


<22 3> 


not 


found 


for 


SEQ 


IDS 


: 13 


L: 


: 7 5 1 


M ; 


:25 8 


V7: 


Mandatory 


Feature 


missing , 


<22()> 


not. 


fou nd 


for 


SEQ 


IDS 


: 13 


Tj : 


7 51 


M: 


:258 


W: 


Mandatory 


Fea ture 


missing , 


<221> 


not 


found 


for 


SEQ 


TD & 


: 13 


L: 


: 751. 


M: 


:258 


W: 


Mandatory 


Feature 


missing, 


<222> 


not 


found 


for 


SEQ 


IDS 


:13 


L: 


;751 


M : 


: 258 


W: 


Mandatory 


Feature 


missing , 


<22 3> 


not 


found 


for 


SEQ 


IDS 


:13 


L: 


1341 H:112 C 


: (48) String data 


converted to lower 


case, 










M: 


.112 


Repea Led 


in SeqNo- 


22 


















L: 


; 1521 M : .1 1 2 C 


: (48) String data 


converted to lover 


case , 










M : 


:112 


Repeated 


in SeqNo= 


24 



















L:lf>34 M:258 W: Mandatory Feature missing, <220> not found for SEQ lDit:25 
L:16 34 M.-258 W: Mandatory Feature missing, <221> not found for SEQ ID#:25 
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VERIFICATION SUMMARY DATE: 01/2 3/20 01 

PATENT APPLICATION : US/09/7 01, 39 5 TIME : 11: 1.3 : 19 



Input Set : A:\108172-00022.txt 

Output Set: N:\CRF3\01232001\I701395.raw 



L:1634 
L : .1 6 3 4 
L:1G34 
L:1743 
L:1743 
L:174 3 
L:1743 
L:1743 
L: 1840 



2 58 W 
:2 58 W 
: S 4 U W 
2 58 W 
258 W 
258 W 
2 58 W 
'J 4 0 W 
112 C 



M:112 Repeated 
L: 1881 M: 112 C : 
M:1'12 Repeated 
L:1931 M:112 C: 
M: I 12 Repeated 
L:1975 M:112 C: 
Mill 2 Repeated 
L:2015 M:I12 C: 
M;112 Repeated 

L:2055 M: 112 C : 
M;112 Repeated 
L: 2.1 25 M: 112 C: 
M:112 Repeated 
L:2163 M: 1.12 C : 
M:112 Repeated 
L:2782 M:112 C: 



J 12 Repeated 
3019 M:258 W 
3019 M:258 W 
3019 M:258 W 
3 019 M:258 W 
3019 M:340 W 



Mandatory feature 
Mandatory Feature 
(46) "n" or "Xaa" 
Mandator y Pe a t u re 
Mandator- y Fe a t u re 
Ha tida L o r y Pe a t u re 
Ma ad a Lory Pea Lu r e 
(4T)) "n" or "Xaa" 
(48) String data 

in SeqNo-23 
(48) String data 

in SeqNo-29 
(4 8) String data 

in SeqNo-^30 
(48) String data 

in SeqNo----3.1 
(48) String data 

in SeqNo--32 
(48) String data 

in SeqNo=33 
(4 8) String data 

in SeqNo-34 
(48) String data 

in SeqNo=3 5 
(48) String data 

in SeqNo-46 
Ma iuia t o i: y Pe a t u r e 
Ma n da tor y Pe a t u re 
Mandatory Feature 
Mandatory Feature 
(46 ) "n" or "Xaa" 



for SEQ IDs 
for SEQ TO if 

for SEQ lUff : 
tor SEQ im 
for SEQ I Drr 
for SEQ iDii 
for SEQ IDs 

for SEO IDS : 



missing, <222> not found 
missing, <2 2 3> not found 
used: Feature required, 
missing, <2 20> not found 
missing, <221> not found 
missing, <222> not. found 
missing, <2 23> not. found 
used: Feature required, 
converted to lower case, 

converted to lower case, 

converted to lower case, 

converted to lower ease, 

converted to lower case, 

converted to lower case, 

converted to lower case, 

converted to lower case, 

converted to lower case, 



missing, <.2 2 0> not found for SEQ ID If 
missing, <2 2.l> not found for SEQ "ID If 
missing, <22 2> not found for SEO ID if 
missing, <223> not found for SEQ iDtf 
used: Feature required, for SEQ IDS: 



:25 
: 25 
2 5 
:26 
: 26 
: 26 
:26 
26 



: 4 8 
:48 
: 48 
: 48 
48 
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